PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Csa05g020550.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Camelina
Family HD-ZIP
Protein Properties Length: 848aa    MW: 92716.4 Da    PI: 6.4262
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Csa05g020550.1genomeCSGPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox57.42.4e-182078357
                    --SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHC....TS-HHHHHHHHHHHHHHHHC CS
        Homeobox  3 kRttftkeqleeLeelFeknrypsaeereeLAkkl....gLterqVkvWFqNrRakekk 57
                    k  ++t+eq+e+Le+++ ++++ps  +r++L +++    +++ +q+kvWFqNrR +ek+
  Csa05g020550.1 20 KYVRYTPEQVEALERVYTECPKPSSLRRQQLIRECpilsNIEPKQIKVWFQNRRCREKQ 78
                    5678*****************************************************97 PP

2START191.15.4e-601693772205
                     HHHHHHHHHHHHHHHC-TT-EEEEEXCCTTEEEEEEESSS.SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-SEEEEEEEECTT..EEEEEE CS
           START   2 laeeaaqelvkkalaeepgWvkssesengdevlqkfeeskvdsgealrasgvvdmvlallveellddkeqWdetlakaetlevissg..galqlm 94 
                     +aeea++e+++ka+ ++  Wv++  +++g++++ +++ s+++sg a+ra+g+v  +++  v+e+l+d++ W ++++ ++tl vi  g  g+++l+
  Csa05g020550.1 169 IAEEALAEFLSKATGTAVDWVQMIGMKPGPDSIGIVAISRNCSGIAARACGLVSLEPM-KVAEILKDRPSWLRDCRCVDTLSVIPAGngGTIELI 262
                     799*******************************************************.8888888888*****************999****** PP

                     EEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--....-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--SSXXHHHHHHHH CS
           START  95 vaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppe...sssvvRaellpSgiliepksnghskvtwvehvdlkgrlphwllrslv 185
                     +++++a+++l++ Rdf+++Ry+ +l++g++v++++S++s +  p+   sss+vRae++pSg+li+p+++g+s +++v+hvdl+++++++++r+l+
  Csa05g020550.1 263 YTQMYAPTTLAAaRDFWTLRYSTCLEDGSYVVCERSLTSATGGPTgppSSSFVRAEMRPSGFLIRPCEGGGSILHIVDHVDLDAWSVPEVMRPLY 357
                     *****************************************9999999*********************************************** PP

                     HHHHHHHHHHHHHHTXXXXX CS
           START 186 ksglaegaktwvatlqrqce 205
                     +s+ + ++k++va+l++ ++
  Csa05g020550.1 358 ESSKILAQKMTVAALRHVRQ 377
                     ***************98765 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5007115.3211579IPR001356Homeobox domain
SMARTSM003895.7E-151783IPR001356Homeobox domain
SuperFamilySSF466898.55E-171983IPR009057Homeodomain-like
CDDcd000864.07E-162080No hitNo description
PfamPF000466.4E-162178IPR001356Homeobox domain
Gene3DG3DSA:1.10.10.602.0E-182278IPR009057Homeodomain-like
CDDcd146865.44E-672111No hitNo description
PROSITE profilePS5084826.98159387IPR002913START domain
CDDcd088751.15E-75163379No hitNo description
Gene3DG3DSA:3.30.530.202.2E-22167373IPR023393START-like domain
SMARTSM002341.1E-63168378IPR002913START domain
SuperFamilySSF559613.57E-36168380No hitNo description
PfamPF018522.9E-57169377IPR002913START domain
SuperFamilySSF559611.04E-5416497No hitNo description
SuperFamilySSF559611.04E-5529604No hitNo description
PfamPF086706.8E-49701846IPR013978MEKHLA
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
GO:0008289Molecular Functionlipid binding
Sequence ? help Back to Top
Protein Sequence    Length: 848 aa     Download sequence    Send to blast
MMMVHTMNRE SPDKGLDSGK YVRYTPEQVE ALERVYTECP KPSSLRRQQL IRECPILSNI  60
EPKQIKVWFQ NRRCREKQRK EAARLQTVNR KLNAMNKLLM EENDRLQKQV SHLVYENGHM  120
KHQLHTASGT TTDNSCESVV VSGQQHQQQN PNPQHLQRDA NNPAGLLSIA EEALAEFLSK  180
ATGTAVDWVQ MIGMKPGPDS IGIVAISRNC SGIAARACGL VSLEPMKVAE ILKDRPSWLR  240
DCRCVDTLSV IPAGNGGTIE LIYTQMYAPT TLAAARDFWT LRYSTCLEDG SYVVCERSLT  300
SATGGPTGPP SSSFVRAEMR PSGFLIRPCE GGGSILHIVD HVDLDAWSVP EVMRPLYESS  360
KILAQKMTVA ALRHVRQIAQ ETSGEVQYGG GRQPAVLRTF SQRLCRGFND AVNGFVDDGW  420
TPMGSDGAED ITVMINLSPG KFGGAQYGNS FLPSFGSGVL CAKASMLLQN VPPAVLIRFL  480
REHRSEWADY GVDAYAAASL RASPFAVPCA RAGGFPSNQV ILPLAQTVEH EESLEVVRLE  540
GHAYSPEDMG LARDMYLLQL CSGVDENVVG GCAQLVFAPI DESFADDAPL LPSGFRVIPL  600
EHKSTPNGAT ANRTLDLASA LEGSTRQAGE ADPNGCNFRS VITIAFQFTF DNHSRDSVAS  660
MARQYVRSIV GSIQRVALAI APRPGSSISP ISVPTSPEAL TLVRWISRSY SLHTGADLFG  720
SDSQTSGDRL LHQLWNHTDA ILCCSLKTNA SPVFTFANQT GLDMLETTLV ALQDIMLDKT  780
LNESGRKALC SEFPKIMQQG YAHLPAGVCA SSMGRMVSYE QATVWKVLED DESNHCLAFM  840
FVNWSFV*
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAJ4412910.0AJ441291.1 Arabidopsis thaliana mRNA for homeodomain-leucine zipper protein 14 (ATHB-14 gene).
GenBankAY0997910.0AY099791.1 Arabidopsis thaliana homeodomain transcription factor (At2g34710) mRNA, complete cds.
GenBankBT0003350.0BT000335.1 Arabidopsis thaliana homeodomain transcription factor (At2g34710) mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010509720.10.0PREDICTED: homeobox-leucine zipper protein ATHB-14
SwissprotO042910.0ATB14_ARATH; Homeobox-leucine zipper protein ATHB-14
TrEMBLR0FTP80.0R0FTP8_9BRAS; Uncharacterized protein
STRINGscaffold_402060.10.0(Arabidopsis lyrata)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM42562653
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G34710.10.0HD-ZIP family protein